K | # of bigrams | # of trigrams | # of 4-grams | # of 5-grams | # of 6-grams |
---|---|---|---|---|---|
100 | 72 | 96 | 99 | 99 | 99 |
1000 | 395 | 755 | 943 | 990 | 999 |
10000 | 2017 | 6108 | 8762 | 9628 | 9867 |
100000 | 4729 | 24472 | 54740 | 77773 | 87259 |
1000000 | 5998 | 34847 | 83858 | 126996 | 148230 |
Both the problem and the results are much similar to the previous subsection: We consider letter-N-grams at the end of words instead of the beginning.
3.8.1 Number of letter-N-grams at word beginnings